Safe MBR-transformation in similar sequence matching

نویسندگان

  • Yang-Sae Moon
  • Byung Suk Lee
چکیده

When a multidimensional index is used for similar sequence matching, the traditional approach is to transform high-dimensional window sequences to low-dimensional sequences and bounding them into a low-dimensional minimum bounding rectangle (MBR). In this paper, we propose a new approach which constructs a low-dimensional MBR by directly transforming a high-dimensional MBR (calledMBR-transformation) bounding the high-dimensional sequences. This approach significantly reduces the number of lower-dimensional transformations needed in similar sequence matching. However, it poses a risk that some transformed sequences may fall outside the transformed lowdimensional MBR. We thus propose safe MBR-transformation which has the property that every possible transformed sequence is inside a safe MBR-transformed MBR. Then, considering the discrete Fourier transform (DFT) and the discrete Cosine transform (DCT), we prove that they are not safe as MBR-transformations, and modify them to become safe MBR-transformations (called mbrDFT if DFT-based and mbrDCT if DCT-based). Then, we prove the safeness and optimality of mbrDFT and mbrDCT. Analyses and experiments show that the mbrDFT and mbrDCT reduce the execution time by several orders of magnitude due to the reduction in the number of lower-dimensional transformations. The proposed safe MBR-transformation provides a useful framework for a variety of applications that require a direct transformation of a high-dimensional MBR to a low-dimensional MBR. Ó 2014 Elsevier Inc. All rights reserved.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

MBR-SIFT: A mirror reflected invariant feature descriptor using a binary representation for image matching

The traditional scale invariant feature transform (SIFT) method can extract distinctive features for image matching. However, it is extremely time-consuming in SIFT matching because of the use of the Euclidean distance measure. Recently, many binary SIFT (BSIFT) methods have been developed to improve matching efficiency; however, none of them is invariant to mirror reflection. To address these ...

متن کامل

EDAM: An Efficient Clique Discovery Algorithm with Frequency Transformation for Finding Motifs

Finding motifs in DNA sequences plays an important role in deciphering transcriptional regulatory mechanisms and drug target identification. In this paper, we propose an efficient algorithm, EDAM, for finding motifs based on frequency transformation and Minimum Bounding Rectangle (MBR) techniques. It works in three phases, frequency transformation, MBR-clique searching and motif discovery. In f...

متن کامل

Neighborhood matrix: A new idea in matching of two dimensional gel images

Automated data analysis and pattern recognition techniques are the requirements of biological and proteomicsresearch studies. The analysis of proteins consists of some stages among which the analysis of two dimensionalelectrophoresis (2-DE) images is crucial. The aim of image capturing is to generate a Photostat that can be used infuture works such as image comparison. The researchers introduce...

متن کامل

Change in the fouling propensity of sludge in membrane bioreactors (MBR) in relation to the accumulation of biopolymer clusters.

A membrane bioreactor (MBR) and an activated sludge process (ASP) were operated side by side to evaluate the change of sludge supernatant characteristics and the evolution of the sludge fouling propensity. The MBR sludge had a higher organic concentration and more biopolymer clusters (BPC) in the supernatant compared with ASP. BPC increased in both concentration and size in the MBR. The results...

متن کامل

ON THE DISTRIBUTION OF RANKED HEIGHTS OF EXCURSIONS OF A BROWNIAN BRIDGE1 By Jim Pitman and Marc Yor

The distribution of the sequence of ranked maximum and minimum values attained during excursions of a standard Brownian bridge Bbr t 0 ≤ t ≤ 1 is described. The height Mbr+ j of the jth highest maximum over a positive excursion of the bridge has the same distribution as Mbr+ 1 /j, where the distribution of Mbr+ 1 = sup0≤t≤1 Bbr t is given by Lévy’s formula P Mbr+ 1 > x = e−2x 2 . The probabilit...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Inf. Sci.

دوره 270  شماره 

صفحات  -

تاریخ انتشار 2014